PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions

نویسندگان

  • Jaroslav Bendl
  • Milos Musil
  • Jan Stourac
  • Jaroslav Zendulka
  • Jirí Damborský
  • Jan Brezovsky
چکیده

An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools' predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations. To enable comprehensive evaluation of variants, the predictions are complemented with annotations from eight databases. The web server is freely available to the community at http://loschmidt.chemi.muni.cz/predictsnp2.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Pattern of Linkage Disequilibrium in Livestock Genome

Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...

متن کامل

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

Molecular Modeling and Docking Studies on the First Chlorotoxin-Like Peptide from Iranian Scorpion Mesobuthuseupeus (Meict) and SNP Variants of Matrix Methaloproteinase-2 (MMP-2)

Background: MeICT is the first chlorotoxin-like peptide isolated from the Iranian Scorpion Mesobuthus eupeus. Chlorotoxin (CTX) is a neurotoxin that specially binds to (MMP-2) on ma-lignant cells and now is used in treatment of glioma. In the present study, we have used homology modeling to propose the 3D structure of MeICTand analyze its interaction with MMP-2 and its SNP types. Methods:The ...

متن کامل

P-241: Association of ITPA Polymorphisms rs1127354 with Infertility

Background: Infertility is a relatively common problem that affects couples worldwide. It is estimated that approximately 1 in 6 couples will experience difficulties in reproducing, defined as a failure to conceive after two years of unprotected sexual intercourse. The molecular and genetic factors underlying the cause of infertility remain largely undiscovered. ITPA is an inosine triphosphatas...

متن کامل

Impact of Genetic Variants in Mir-122 Gene and its Flanking Regions on Hepatitis B Risk

MicroRNAs are small non coding RNAs that are involved in gene expression regulation. Mir-122 was reported to inhibit hepatitis B virus (HBV), but little is known about the role of mir-122 polymorphisms on HBV infection development. This present study aimed to investigate the association between single nucleotide polymorphisms (SNPs) in mir-122 gene region with HBV infection. Study cases were HB...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2016